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AESTBACT 

A basic assumption cf this study is that the meaning 
continuum registered by an adjective pair remains relatively constant 
over a large universe of concepts and over subjects within a 
relatively homogeneous population. An attempt was made to validate 
this assumption by shewing the invariance of the factor structure 
across different types of subjects, and across different classes of 
concepts. The existence of systematic response tendencies independent 
o.t the meaning cf the adjectives was demonstrated. Although the 
substantive results agreed generally with the Evaluation, Activity 
and Potency dimensions cf earlier research, notable differences were 
feund; specifically, that a clear separation between a hedonic 
response and a judgment of values seems to exist. The data from the 
60 developed scales suggests seven useful composites which may be 
used fairly confidently (with similar samples) to measure attitudes 
without the need to factor analyze the scales for the new data. (TA) 



U.s. DEPARTMENT OF HEALTH. EDUCATION 
& WELFARE 
OFFICE OF EOUCATI ON 

this document has been reproduced 

EXACTLY AS RECEIVED FROM THE PERSON OR 
ORGANIZATION ORIGINATING IT. POINTS OF 
VIEW OR OPINIONS STATED DO NOT NECES- 
SARILY REPRESENT OFFICIAL OFFICE OF EDU- 
CATION POSITION OR POLICY. 



r- 

sO 
« — i 


STANDARDIZATION OF SELECTED SEMANTIC DIFFERENTIAL 
SCALES WITH SECONDARY SCHOOL CHILDREN. *+ 


CD 

UD 

CD 

CD 

Ui 


G. T. Evans 

The Ontario Institute for Studies in Education 


62 


The Semantic Differential Model 

A semantic differential scale consists of a pair of adjectives 
of opposite polarity. Its use with a particular person requires that 
he react to a given "concept 0 in terms of these adjectives by placing a 
mark at an appropriate point on a line drawn between the two bipolar terms. 

In the usual 7-point form of the scale the value 4 is regarded as 
neutral, while values of 1 and 7 are regarded as extremes. 

The semantic differential scale has been elaborated following the 
work of Osgood, Suci and Tannenbaum (1957) into an instrument for recording 
affective responses to stimuli, for measuring "meaning" in a generalized 
sense. It is assumed that the verbal Judgment responses can be character- 
ized in terms of a few idealised responses known as semantic dimensions, 
which, when regarded as linearly independent vectors, are said to span the 
semantic space. A review of the methodology of the semantic differential 
has been recently made by Kelse (1969) . The purpose of this section is to 


in 


extend and formalize some of Heise’s observations. 
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The model considered below refers to some defined population of 
persons, and is thus a statistical model in that its parameters refer 
only to constructs defined over such a population, and are not defined 
by the responses of a single person. One version of the model Is given 
by 



( 1 ) 






+ + S J ' 



,k » 1, , v, 

J a 1* ...» n. 



O 

ERIC 



where x^j is a random variate interpreted as the response made by a 
randomly drawn subject on the j.th scale to the kth concept, and f ^ ia 
the hypothetical response made by the same subject to the kth concept 
on the pth idealised semantic dimension. The parameter is a regression 
weight, indicating the regression of the jf.h scale on the pth semantic 
dimension, a random variate indicating the extent to which the 

semantic dimensions do not account for the random subject’s response on the 
jth scale for the kth concept. The terms and B^ are scaling constants 
associated with the jth scale which may be used to adjust its mean and 
dispersion. 

In order for the model to be useful, it in necessary to place 
certain restrictions on the distributions of the quantities x^j » f^, and e^. 
The last two are assumed to have expected values of zero and variances of 
unity, and 0^ may be chosen so that E(x^^) is aero. Further, it is supposed 
that error terns for different concept-scale combinations are uncorrelated 
with one another and with the "factor" variables, f j,, p * 1, ..., m; 
k *> 1, v, which refer to the various combinations of concepts and 

semantic dimensions. The strict assumptions of the statistical model of 
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factor analysis also require that the three sets of variates each have a 
multivariate normal distribution. la utilizing the model, a two stage 
strategy may be used. The first stage consists in estimating parameters, 

*jp* 3 " 1» •*»» h; p » 1, m, for the population. The second stage 

involves measuring the responses, f „ on the idealized semantic dimensions 
for groups of persons and concepts. The second stage depends in the 
standardization involved in the first. 

To estimate parameters like a it is necessary to draw a sample 
of, say, N persons from the population of interest and to select v 
appropriate concepts. If the model were treated analogously to that for 
factor analysis the n data for each person-concept combination would 
constitute a single multivariate observation, and there would result an 
n x n correlation matrix based on vN observations. This is Heise's 
(1969) Design 2 for an ad hoc factor analysis of the scales. Helse 
discusses the considerations underlying the choice of both persons and 
concepts. 

More frequently, a different interpretation is placed on the model, 
and the v concepts, instead of being regarded as fixed effects, are 
regarded as random observations from a population, or sub-population, of con- 
cepts. In this case the model may be written. 
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where now refers to the response on the jth scale made by a randomly 
drawn subject to a randomly drawn concept. 
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To estimate the parameters like a^, it has frequently been the 
practice to find the average rating over all N subjects for each scale on 
each of the v concepts in a sample of concepts. In this way the variability 
of the ratings reflects only concept variability, and the replication across 
persons merely serves to stabilize the observations . Tills is Heise’s (1969) 

Design 1 resulting in an n x n matrix of correlations between scales based on 
u observations. The use of this design seems difficult to justify in terms of 
the model, and its success probably depends on the fact that individual differ- 
ences between subjects using the same scales fox" the same concept are generally 
much less than differences between ratings on concepts judged by a single subject 
using the same scale. Use of statistical factor analysis models are even harder 
to justify in this design, since the sample of concepts may rarely be considered 
as random, but is chosen to span the semantic differential space (See Heise,1969). 

By analogy with factor analysis, the most direct way of estimating the 
pattern coefficients, a. , would be to draw a sample of person-concept combina- 
tions from all possible combinations in the population of persons and universe 
of concepts. This would be a very inefficient procedure in practice, but it 
can be approximated by choosing a large number o£ concepts (say over 100) and 
allocating a small umber of concepts (between 1 and 8) at random from the 
list to each subject. If 100 subjects were used, each rating a sample of 
4 concepts with say 50 scales, there would result a 50 x 50 matrix of scale 
Intercorrelations, based on 400 observations. This design enables 
estimation of parameters for given populations of persons and concepts. If 
the samples from either population were chosen carefully, it would seem 
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reasonable to use these estimates of parameters, without further factor 
analyses, in any further work using subsets of either the person or concept 
population- We will call this Design 3. 

The factor variables suggested by the factor analysis are hypotheti- 
cal constructs and the factor score, £ ^, for any particular person, and any 
particular concept cannot be precisely deter mined. It is, however, possible 
to constrvsct operational variables, linear weighted composites of the observed 
ratings, whose statistical distributions approximates in various ways, the 
distributions of the factor scores. One method of doing this Is to utilise 
the various formulae for "estimating" factor scores, using the estimates of 
the pattern coefficients as if they were known parameters. (See Harman, 

1967, p. 345 j Harris, 1967, and McDonald and Burr, 1967). These methods 
may only be used, of course, when all scales of the original standardization 
are used. 

When the matrix o£ pattern coefficients contains only one substantial 
entry per row, the various scales fora well defined, distinct clusters. This 
type of solution, most likely achieved by the. use of an appropriate oblique 
transformation of the initial factor solution (Harman, 1967, p.273, Harris and 
Kaiser, 1964 ) , indicates a direct link between the factor variables and the 
clusters of observed variables. Under these circumstances, a simple 
addition of ratings on scales in the same cluster may produce sets of scores 
which are highly correlated with appropriate factor variables. This is, in 
any case, the method advocated by Osgood et al (1957). The task of the 
factor analysis becomes one of clustering scales according to their degree 
of correlation, and the pattern matrix indicates which scales can reasonably 
be combined to provide measures of general semantic response tendencies. 
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How many scales should be combined to produce a variable which reflects the 
common factor of the scales depends on the siase of the pattern coefficients. 
The best guide to the number needed would be a measure of the internal con- 
sistency of the resulting composite, as measured by coefficient alpha (Lord 
and Novick, 1968, p. 87 ). 

An important question in the interpretation of the model is that 
concerned with concept-scale interaction. In terms of equation (1) , it may be 
that two investigations using different sets of concepts but the same set 
of scales, give rise to very different factor constructs. Or it may be 
that the same factor constructs emerge but the loadings of some scales 
change their allegiances. Heise (1969) has given an excellent discussion 
of this problem and shows that such concept scale interactions may arise 
because of three artif actual reasons and two true conditions. An 
additional artif actual reason may be that, factor solutions being essen- 
tially indeterminate, the pattern matrices from the two studies have not been 
optimally matched (Evans, In Press). 

In terms of the models above, concept-scale interaction may be 
characterized as within a given population of concepts or as between differ- 
ent populations of concepts. In the first case, use of Design 3 simply 
eliminates the possibility of noticing a concept-scale interaction, since 
only a single estimate of the pattern coefficients is made. This result 
gives rise to one difficulty. Adjective pairs like sharp-dull, which 
change meaning according to the concept rated, are a particular source of 
trouble if the population of concepts is very heterogeneous. Concept-scale 
interactions observable between different classes of concepts may be dealt 
with according to Heise' s suggestions. 
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Response Tendencies 

Variations in the extent to which subjects are prepared to utilise 
the full range of the scale In responding are well known (Peabody, 1962, 
Arthur, 1966). He! sc (1969), in his review, and Kahneman (1963) describe a 
mechanism by which this kind of response bias, different within-subject 
variances of ratings, can affect the structure of the correlations between 
scales, particularly when only one concept is involved. Where many concepts 
are involved, it is not clear what will be its effect. It is convenient to 
designate this variable as response dispersion and to define it formally for 
a given subject, i, as d^, where 



where x, , . is the response of the ith subject to the kth concept using the 



jfch scale, and is the mean for that subject over all S scales and C 

concepts . 

It would be desirable to assess response dispersion independently of 
the data used to estimate the pattern coefficients of a particular set of 
scales. For this reason the set of S scales and C 'concepts referred to In 
formula (3) should be selected for just this purpose. 

Another type of systematic bias on which there may be individual 
differences is the response centering , the mean response for a given subject 
referred to in formula (3) , given by 
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Messick (1957) demonstrated a median tendency for subjects to displace the 
centre point of the scale. 

The Standardisation Study 

The aims of the study may foe described in terms of the above model 
as follows: 

(i) To recheck the feasibility of wsing semantic differential 
ratings with secondary school students. 

(ii) To study the reliability of semantic differential judgments 
for the above population of students. 

(iii) To provi.de standardized estimates of the pattern coefficients 
of the model by using; a large sample of students and a large sample of 
concepts for a particular school population. The population of concepts 
used was English nouns „ and about 1,000 of the most frequently used nouns 
were Included in the list. 

(iv) To study the effects of individual differences in methods of 
responding on the factorial structure. 

(v) To study the nature of the semantic dimensions for the above 
person-concept population, and to suggest operational measures. 

(vi) To study the stability of the factor pattern across different 
sub-groups of the sample; boys, girls; grade 8,, grade 10, grade 12. 

(vii) To study the stability of the factor pattern across a random 
subdivision of concepts and persons. 

(viii) To study the stability of the factor pattern, across a-prio ri . 
divisions of the concepts into different categories. 
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METHOD 



Ten high schools in Ontario and their companion junior high schools 
were selected from electoral districts ehoaen at random to he representative 
of metropolitan, urban, and rural districts, providing a sample or tenj sets 
of grade 8, grade 10, and grade 12 students, a total of 843 students. 1 A 

I 

total of 1,080 mourns of highest frequency were chosen from the Thorndike and 

| 

Lorge Teachers Word Book <1944) md split at random i \to two groups*; From 

/ 

each list* sets of 4 random words were generated without replacement^, the 

/ 

lists re-randomized, md the process repeated* The sixtv scales we/fe chosen to 

t 

represent a larger list of 340 scales compiled from Roget f s Thesaurus and 

other sources. The sixty scales were listed in random order and t$ie polarity 

$ 

was randomly alloted. For further details of the method of chooa^ng the 
scales sec Evans (1970). The actual adjective pairs are listed lfa Table 3. 

To measure response dispersion, response centering, and &cale relia- 
bility* two sets of sift concepts ware chosen from Heise f s (1965)/ list in such 

> 

i 

a way as to have large variance and zero mean on their semantic! coordinates 
for evaluation (E) * activity (A), and potency (?) t according C<j Heise's esti- 

j 

mates* In the same way, two sets of adjectives were chosen, eueh set 

i. 

containing two adjectives as measure© of each EPA dimension* ^These 72 
combinations were presented, with random polarity, along with^ the main task, 
and 24 of these combinations were repeated In scrambled order| after the main 
task. Details of the response bias measures are presented ii| Table 2* 

I 

The main t&sk for each subject was to make 240 ratings, 60 for each 

f! 

of 4 rmuto&ly chosen concepts. The complete set of 336 combinations were 

, ;( 

presented in bookies: form. Because of the way in which the .‘lists of 4 concepts 
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were generated, it was possible to designate subjects as having concepts 
from one random group or the other. Even-numbered subjects were given 
concepts from one list, odd-numbered subjects, concepts from the other. 

The booklets used consisted of 12 "dlgitefc" sheets, specially 
prepared for this sUidy and later machine processed. All except the 
second page contained 30 adjective pairs; the second page contained 6. 

The first 36 items comprised Set 1 of the response bias combinations. 

Next followed four sets of sixty scales which referred to four different 
concepts, one concept for two pages. In this case the concept was typed 
at the top of the page in the following manner 

GRANDFATHER (He is their Grandfather ) . 

Heise’s (1965) method of using an affectively neutral sentence to 

elaborate the meaning of the concept was adopted. Each subject received a 

different set of concepts which were numbered for later identification. 

Next followed Set 2 of the response bias measures and the 24 repeated items. 

The students worked in groups of either about 30 or about 60, under the 

supervision of a trained research assistant. Before beginning the task, 

each group was instructed in the use of the semantic differential, by means 

of separate practice sheets, and In the use of digifcek responses. No time 

limit waa made, but an hour usually proved more than 'sufficient. 

All response sheets were machine scored, and the data stored on 

magnetic tape. Because of omissions and other errors, only 800 of the 843 sets 

of data were usable.. The ratings were initially scored between 1 and 7, the pole 

corresponding to a ”7" rating in. each adjective pair In placed first in Tables 2 

and 3. These poles were initially intended to reprcs&nt a. more positive 

1 

evaluation. * more activity, or more potency. 

O 
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ANALYSIS OF RESULTS 

The 24 repeated concept-scale combinations provide data on short 
term test-retest reliability, the correlations being shown in Table 1. 
These range in value between , 37 and .73. For combinations of, say, six 
scales, use of the Spearman Brown formula indicates the corresponding 
reliabilities would range from .80 to .94, and are thus very promising. 
Other reliability data of the same kind have been reported by Evans (1970) . 

Insert Table 1 about here 



Two measures of response bias were calculated. For each subject, 
the response centering and response dispersion for each set of 36 bias 
scales were found. The correlation between the two measures of response 
centering was .47, and that between the two measures of response dispersion 
was .59. Other statistics are provided in Table 2. These bias measures 
thna appear to represent fairly consistent individual differences in subject 
response tendencies. 



Insert Table 2 about here 
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Before preceding to the main analysis, it is useful to study the 
response characteristics for the 60 scales being standardised. Each scale 
was vised to rate four concepts by 800 subjects, giving rise to a distribu- 
tion of 3,200 ratings on the scale. The mean and standard deviations of 
each, of these 60 sets of ratings may be rased to detect any abnormalities 
in the use of any particular scale. The means of the 60 ratings ranged 
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from 3.2 for "humble-proud" and 3.8 for "simple-complex" to 5.2 for "good- 
bad" and "valuable -worthless" and 5.0 for "meaningful-meaningless". The 
grand mean was 4.54 and the standard deviation of the mean was 0.37. 

The average standard deviation for the 60 scales was 1.76, and standard 
deviation, 0.13, che range being from 1.5 to 2.1. It thus appears that, while 
there is a systematic bias towards the more pleasant, valuable, active, and 
potent pole of the scales, no set of ratings was very skewed or leptokurtic. 

The main analyses were concerned with the factorial structure of the 
60 scales. The first analysis utilized data from the 60 scales and all 800 
subjects. The 3,200 observations were used to compute a .60 x 60 matrix of 
correlation coefficients between scales. This was factor analysed using 
Thomson's least squares iterative technique (See Harman, 1967, p.135). The 
number of factors was set at 10, the number of principal components extracted 
in the first iteration with latent roots greater than unity. The factor 
matrix was transformed by varimax (Kaiser, 1958), Proraax (Hendrickson and 
White, 1964), and Schmid and Leiman (1957) programs, and each of the alterna- 
tive solutions compared. The varimax solution was chosen as being most useful, 
since it provides the same information as the other two, and was in this 
instance more easily understood. It is presented in Table 3. 

Insert Table 3 about here 



The adjective pairs in Table 3 have been rearranged from their 
original random order of presentation to bring out more clearly the nature 
of the factors, and lines have been drawn to group the scales as far as 
possible into clusters, although some overlapping is apparent. The first 
seven of the clusters correspond very closely with the main oblique factoirs 
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of the- prottfijc analysis. 

This table is central to the paper, since all other factor 
analytical results were compared with it, including those derived from 
the. ratings after they had, been adjusted for person response bias. Before 
we proceed to discuss the factors, the effect of response bias on the 
factor structure may be thus fairly quickly dismissed. The second factor 
analysis was performed in exactly the same way as that above, except that the 
data for each subject were adjusted by subtracting the constant fro® 

each rating end dividing the result by the response dispersion d^. In this 
way the data for each person were adjusted for independently estimated 
retipcinse biases. The results of the factor analysis of the adjusted ratings 
were than compared with the coefficients in Table 3. 

Before such a comparison is made an attempt should be made to 
maxlutite the similarity between the two matrices, so as to eliminate 
ar til: actual dissimilarities due to using different bases. A convenient method, 
in this case,, is to regard the matrix based on the unadjusted ratings as a 
standard and to rotate the second matrix to maximum congruence with it. 

(See Cliff, 1966, Schonemiin, 1066, Evans, In Press) . 

When this has been done the two matrices may be compared factor by 
factor. Two measures of agreement between the factor coefficients may be used-* 
Tucker's (1951) coefficient of congruence (t), which has a maximum of 1.0, 
and the root mean square difference between corresponding coefficients (6). 

Each of these Indices is shown for each factor in Table 4. The first two rows 

Insert Table 4 about here 
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of numbers refer to the match between the factors for the adjusted and 
unadjusted ratings. Congruence coefficients (t) greater than 0.9 indicate 
good agreement, those better than 0.8 usually indicate sufficiently good 
agreement to accept the factor variables as referring to similar dimensions 
of attitude. Low values of the root mean square difference (6), less than 
0.05, say, indicate not only approximate proportionality between the two 
columns of factor coefficients, but, in addition, a high level of absolute 
agreement. Of the two indices, the congruence coefficient is to be preferred 
because of its relative standardization. 

In terms of both criteria, there is evidently very close agreement 
between the factorial structures derived from the unadjusted and adjusted 
ratings. For this reason, in all other comparisons of structures, response 
bias corrections have not been made. 

Comparisons of sub-samples of persons with the total sample were 
also made. The measures of factorial similarity with the total sample 
(matrix A Q ) for boys (matrix A 2 > , girls (A^) , Grade 8 (A^) , Grade 10 (A^) , 
and Grade 12 (A^) are shown in Table 4. In each case the extent of agree- 
ment is high, even taking into account the fact that the sub-sample is 
included in the total sample. This finding is in line with the many reported 
findings of invariance across different samples of persons in semantic differential 
dimensional structure. It does, in this case, give evidence of the generality 
of the results throughout the population of persons sampled. 

Invariance of the semantic differential dimensions within different 
classes of concepts has not always been demonstrated — rather, the reverse 
has been the case, giving rise to the notion of concept-scale Interaction. 

To study this type of Invariance or interaction, the list of 1,080 concepts was 
divided in two ways. First a random division of concepts was made, as 
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explained i a the procedure, and the two resulting lists used for a random 
division of subjects (odd and even). The similarity of the factors 
(A j and. Ag) resulting from this division with those for the total sample 
are shown in Table 4. Again the agreement is high, as might be anticipated. 

The second division was baaed on an a priori classification of the concepts. 

Each of the 1,089 concepts was allocated, by three independent judges, to one 
of six concept classes. Disagreements between judges amounted initially to 
51 nouns. The disagreements were discussed and a final consensus allocation made, 
except for two concepts. The classes of concepts used were as follows: 

(1) Persons: the referent of the noun is a person, class of persons, 

or group of persons.. For example, nurse, stranger, crowd. 

(436 observations) . 

(2) Concepts pertaining to the human body. For example, birth, 
shoulder, wound. (124 observations). 

(3) Substances or things grown, mined, made, collected, hunted, by 
people in this culture — food, clothing, buildings, transport, 
communications, works of art*, etc. For example, jewel, shop, 
rug, highway, butter. (844 observations). 

(4) Abstract concepts associated with human endeavour. For example, 
news, title, fun, mystery, campaign, scorn. (1133 observations). 

(5) Maturally occurring things and animals. 

Nature at large. For example, plant, sky, lake, mouse. 

(315 observations) . 

(6) Relationships t The referent of the noun is an abstract relation- 
ship. For example, corner, form, connection. (340 observations) . 

15 




Most disagreements between the judges were In categories 3 and 5. 

'Che correlation matrices for ratings of each concept type were compared 
with the correlation matrix for the total person-concept sample. For 
concept types l» 3, 4, and 6 the agreement was very close, Indicating a close 
similarity of factorial structure. For two types, however, there were large 
enough disagreements to warrant a separate factor analysis. These were 
concept types 2 and 5. Measures of similarity between the factor matrices 
corresponding to these two concept groups (Ag and A^ Q ) , and the target matrix, 

Aq, are presented in Table 4. 

The largest disagreements are with concept type 2, for which the only 
acceptable coefficients of congruence refer to factors I, V, and VI, factors 
of general affect, activity, and potency. A visual inspection of the coeffi- 
cients also indicates some degree of similarity for factors II, HI, IV, and VII. 
Concept type 5 shows a greater measure of agreement, the congruence coefficients 
reaching acceptable values for all but factor VII, Ease, and Factor X. 

The discrepant factors are also minor in terms of the amount of variance 
accounted for. 

Thus, as far as concept classes are concerned, there is evidence of 
general similarity particularly within dimensions yhich are similar to 
Osgood's EPA factors, but it is clear that some classes of words lead to 
noticeable discrepancies. With more narrowly defined classes, the discrepan- 
cies could no doubt be greatly Increased. The interaction between concept 
class and factorial structure appears, in this case, to be due more to lack of 
scale relevance than any other condition. Judgments of competence, success, 
ease, meekness, or value would probably be considered irrelevant by most of 
the subjects for parts of the body. The data are currently being analyzed 
more intensively to study the factorial shifts associated with various types 
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of concept. 

The Factor Variables 

The. coefficients in Table 3 may be used in the usual way to provide 
a tentative description of the semantic response dimensions which may* on the 
basis of these data* be hypothesized to under ly the semantic differential 
scales. First, it should be noticed that many scales have quite low 
communalities — the average is .40 — indicating a fair amount of specificity. 

If factor scores are estimated, or scales in other ways combined, it is 
essential to check the internal consistency of the composite. Second, in the 
varimax solution, there is a high degree of overlap among the scales which 
define the factors. For this reason if scales are simply added, it will be 
difficult or impossible to produce even approximately uncorrelated composites. 

The first factor corresponds closely to what has been termed evalu^ti*-™ 
(Osgood et al, 1957), but the scales most closely related to the judgment of 
value ("'right-wrong", "good-bad", "valuable-worthless"), also load on the second 
factor along with "wise-foolish", and "meaningful -meaningless". In the oblique 
analysis these five scales are separated from the rest of the scales in factor I. 
So, too, are the scales vdiich. load ori both Factor 1 and Factor 111 ("contented- 
discontented", "successful-unsuccessful", "sure-uncertain", "rich-poor"). For 
this reason. Factor 1 has been named "General Affect," suggesting that it 
represents a general affective response of liking vs. disliking. The variable 
is defined principally by the scales "pleasant-unpleasant", "friendly-hostile" „ 
"pleased-angry", "kind-cruel", "beautiful-ugly", "comforting-frightening", 
"gentle-violent", "delightful-dreadful" , "happy-sad", "loving-hating" , and 
"sweet-raour ' » all of which have coefficient!! greater than .60, and all. of which 
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are defining scales for the corresponding oblique factor. The scale 
"good-bad" has a coefficient of .61 but is not a defining scale for 
the oblique factor sad it also has a high loading on varlmax Factor II. 

The hedonistic interpretation of Factor I differs from that made of 
similar factors by many other workers, but for the data presented here, there 
is a clear need to differentiate between a pleasure-pain response and evalu- 
ation. However, Factor I is moderately correlated with scales which 
apparently measure value and also with ocalaa which are measures of activity. 
The name. "General Affect" has therefore been used. 

Hie judgment of value characterises Factor II, while Factor III 
correlates highly with scales suggesting care and competence. It is 
noteworthy that "anxious", as opposed to "carefree", is perceived to be at 
the positive end of this factor. The first four factors appear to span a 
cluster of judgments which may be broadly described as evaluative in the 
sense used by Osgood et al. Keise (1969) reviews arguments and evidence 
that the use of ratings of individual subjects does tend to result in a 
splintering of the evaluative dimension. This has certainly occurred here. 
The split between the hedonic and value factors also agrees in part with 
the results of Komrifa and Bass (1967) <. 

The activity and potency factors reported in many semantic differ- 
ential studies are presented here also in Factor V and Factor VI. Seven of 
the scales which define the Activity Factor appear to refer more to pernonol 
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energy or excitement than to physical activity as such. There is also 
some overlap with the general affective factor with five of these seven 
scales. The scale "hot-cold" does not load on this factor. The potency 
factor differs from that frequently obtained in that the scales "rough- 
smooth" and "hard-soft" are not even moderately highly correlated with 
other scales which define the factor. Factor VII clearly discriminates the 
easy from the difficult. It is of interest that "familiai^-strange" and 
"simple-complex" evoke a similar response to "easy-difficult," justifying, 
from the students’ point of view, traditional maxims of teaching. Factor 
VIII appears to discriminate humility and unselfishness from their judg- 
mental opposites, and is named "Meekness". The two remaining factors involve 
only three adjective pairs and are difficult or impossible to interpret. 



Summary and Discussion 

A central assumption in this study was that the meaning continuum 
registered by an adjective pair remains relatively constant over a large 
universe of concepts and over subjects within the relatively homogeneous 
population tested. Real shifts in meaning of adjectives of the kind 
discussed by Heise (1969) w«:re regarded as rare. Variability in ratings 
was assumed to arise, not from changes in meaning of the adjectival pair, but 
from different responses to different concepts by the same person, and from 
different responses to the same concept by different persons. This assump- 
tion gives the adjectival scales a constancy of interpretation in research 
with the particular population of subjects, without which their use would be 
logically difficult. It also enables the model described at the: beginning of 
of the paper to be used in the search for useful clusterings of adjectival 
meanings, which might reflect some underlying semantic generalities within 
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the population studied. 

An attempt was made to validate this main assumption by showing 
the invariance of the factor structure both across different types of 
subjects and across different classes of concepts. It was suggested that the 
small departures of two concept types freat the general structure could be ex- 
plained by the fact that some of the scales were irrelevant for these kinds 
of concepts. This would not invalidate the general use of the scales, although, 
it might obviously result in. a loss in efficiency in souse instances. 

The existence of systematic response tendencies independent of the 
meaning of the adjectives were demonstrated. Such tendencies are important 
for interpreting individual results, and a set of 72 items which fairly 
reliably measures response dispersion was presented. Response bias appears 
to have little effect on the factorial structure of semantic differential 
scales. 

The general substantive results agreed generally with the Evaluation, 
Activity, and Potency dimensions of Osgood et al (1957) and others, but 
notable differences were found. In particular, it is argued that there is 
a clear separation between a hedonic response and “a judgment of values. The 
data from there 60 scales suggest seven useful composite scales which may be 
formed from the following adjectival pairs, by simple addition. 

General Affect: pleasant-unpleasant, kind-cruel,, friendly-hostile, 

delightful-dreadful, pleased-angry, beautiful- 
ugly, loving-hating, comforting-frightening. 

Value : good-bad » meaningful-meaningless , valuable-worthless » 

right-wrong. 

Success: sure-uncertain, successful-unsuccessful, contented- 

discontented , unruffled-embarrassed. 
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Competence: 

Activity: 

Potency: 

Ease; 

Such composites eotild fairly confidently be tused for samples similar 
to that described above , to measure attitudes and differences in attitudes 
to various concepts, 'without the need to factor-analyze the scales for the 
new data. 



alert-dreamy, educated-ignorant, careful-careless, 
groan-up-childish , wise-foolish. 

Excited-bored, moving-still, fast-slow, fresh-stale, 
changing-permanent , young-old, up-to-date - out-of-date. 

strong-weak, large-small, deep-shallow, brave-cowardly. 

familiar-strange, easy-difficult, simple-complex. 
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TABLE 1 



RELIABILITY OF SEMANTIC DIFFERENTIAL SCALES 
CORRELATIONS BETWEEN REPEATED ADMINISTRATIONS 



Concept 


* 


S 


c a I e 










i 


2 


3 


4 


5 


6 




Good 


Beautiful 


Fast 


Hot 


Strong 


Thick 


1 . tt&Klv&t 


.67 


*73 


.55 


.55 


.65 


.61 


1 

2 . Eaei ey 


.53 


.56 


.43 


.44 


.59 


.50 


3* Laugh 


*54 


.54 


.42 


.49 


,38 


.47 


4* Iron 


,40 




.59 


.39 


.52 


.37 


.46 
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TABLE 2 

RESPONSE BIAS MEASURES, HEARS, STANDARD 
DEVIATIONS , AND CORRELATIONS 



SET 1. (36 scale- concept combinations) 

SAroesMffiAca ' 

Scales ? Good-bad., beautiful-ugly, fent-slew, hot-mid, strong-weak, thick-chin. 



Concepts: 


Mother 


Enemy 


Fire 


Sl&fep 


Laugh 


Iron 


Mean 


t®±8» 


. E 


1.7 ■ 


-3.3 


-3.3 


1.7 


0.7 


0.6 


-0.3 


Profile 


■ A 


1.4 


0.3 


2.7 


-3,2 


1.0 


-2.3 


0.1 


Values 


f 


-2.8 


0.3 


0.1 


-0.7 


-1.6 


4.5 


0.0 


Polarisation 


3.6 


3.4 


4*4 3*7 


2.6 


5.1 





Response Centering Measures: Mean “ 4.80, S.D. <“ 0.38 
S^ B R&ftponue Dispersion Measures: Mean «■ 1.83, S.D. ® 0.35 



SET 2. (36 scale-concept cosabinatioas) 

HI It i w 1 



Scales i 


Nice-awful, 


fair -unfair,, 


moving-still 


j, ah&rp-dttll. 


hard-soft. 


heavy- 


Concept; is : 

-Oil Ml jM#sw mii» 


Baby 


Argument 


Danger 
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?i«r 


Rock 
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Mis* 
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—2 . 8 
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3.2 


1.9 


5.0 





Mg, Ktetponae Centering Measures : Mean » 4.51, S.D. » 0.35 
Sg, Response Diaper si on Measures: Mean ® 2.0:1, S.D. « 0.42 



M X M 2 S 2 


Estimated Reliability of 
72 item Scale 


Correlations SfL . ,, 

<2 a 4/ 

S .11 .24 

1 

S, ,36 ,32 .59 

2 


! Centering : .65 

Dispersion: .74 
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In each case the nnnber of persons in the sanple is shown. Since each person rated foor concepts 
the nnber of observations is t«ss this stashes. 

In this case Che total ntssber of observations (W) is given. 



